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Description 

The present invention is in the field of human nnedicine, particularly in the treatment of obesity and disorders 
associated with obesity. More specifically, the present invention relates to anti-obesity proteins based oh chimpanzee 

5 leptin. 

Obesity, and especially upper body obesity, is a common and very serious public health problem in the United 
States and throughout the world. According to recent statistics, more than 25% of the United States population and 
27% of the Canadian population are ovenweight. Kuczmarski. Amer. J. of Clin. Nutr. 55: 4958 - 5028 (1992); Reeder 
et. al. . Can. Med. Ass. J. . 23: 226-233 (1 992). Upper body obesity is the strongest risk factor known for type II diabetes 

10 mellitus, and is a strong risk factor for cardiovascular disease and cancer as well. Recent estimates for the medical 
cost of obesity are $150,000,000,000 world wide. The problem has become serious enough that the surgeon general 
has begun an initiative to combat the ever increasing adiposity rampant in American society. 

Much of this obesity induced pathology can be attributed to the strong association with dyslipidemia, hypertension, 
and insulin resistance. Many studies have demonstrated that reduction in obesity by diet and exercise reduces these 

IS risk factors dramatically. Unfortunately, these treatments are largely unsuccessful with a failure rate reaching 95%. 
This failure may be due to the fact that the condition is strongly associated with genetically inherited factors that con- 
tribute to increased appetite, preference for highly caloric foods, reduced physical activity, and increased lipogenic 
metabolism. This indicates that people inheriting these genetic traits are prone to becoming obese regardless of their 
efforts to combat the condition. Therefore, a pharmacological agent that can correct this adiposity handicap and allow 

20 the physician to successfully treat obese patients in spite of their genetic inheritance is needed. 

The ob/ob mouse is a model of obesity and diabetes that is known to carry an autosomal recessive trait linked to 
a mutation in the sixth chromosome. Recently, Yiying Zhang and co-workers published the positional cloning of the 
mouse gene linked with this condition. Yiying Zhang et al. Nature 372 : 425-32 (1 994). This report disclosed the murine 
and human protein expressed in adipose tissue. Likewise, Murakami et al., in Biochemical and Biophysical Research 

25 Communications 209(3) :944-52 (1995) report the cloning and expression of the rat obese gene. The protein, which 
is encoded by the ob gene, has demonstrated an ability to effectively regulate adiposity In mice. Pelleymounter et al., 
Science 269 : 540-543 (1 995). 

A parenteral formulation containing insoluble protein causes problems relating to inconsistency In the dose-re- 
sponse as well as unpredictability. The unpredictability is believed to be due to greater variability in the pharamacok- 

30 inetics in suspension formulations. The Insoluble formulations must first dissolve prior to adsorption. It is hypothesized 
that this step has significant varaibility in a subcutaneous depot. 

Unfortunately, the naturally occurring obesity proteins demonstrate a propensity to aggregate and cause a re- 
sponse at the site of injection making the preparation of a soluble, pharmaceutically acceptable parenteral formulation 
exceedingly difficult. The molecular interactions amongst the preservative, buffer, ionic strength, pH, temperature, and 

35 any additional excipients such as a surfactant, or sugar are highly unpredictable in view of the propensity for the obesity 
protein to aggregate and precipitate from the formulation. 

Obesity proteins have been developed and have demonstrated pharmacological activity. Some of these proteins 
demonstrate significant improvement in physical properties and stability. Such proteins are disclosed in W096/23517. 
The present invention provides obesity proteins based on the chimpanzee leptin sequence disckjsed in U.S. ap- 

40 plication number 60/003,935 and conditions under which solubility of such proteins are enhanced. Thus, the invention 
provides longer shelf life, ease of manufacture, and more convenient patient delivery. Most unexpectedly, the physical 
stability of the fomnulation is greatly enhanced in the presence of alkyl parabens such as methylparaben, ethylparaben, 
propylparaben, and butylparaben, as well as chlorobutanol or a mixture thereof. That is, when formulated under the 
conditions described herein, these novel obesity proteins remain soluble at much higher concentratbns and at a pH 

45 range acceptable for a soluble, multi-use parenteral formulation. Accordingly, the present Invention provides soluble, 
parenteral formulations of novel obesity proteins based on chimpanzee leptin. 

Proteins useful for treating obesity selected from SEQ ID NOs 1 -1 3 are claimed. Soluble parenteral formulations, 
comprising an obesity protein and a preservative selected from the group consisting of alkylparaben, chlorobutanol, 
or a mixture thereof are also claimed. 

so For purposes of the present invention, as disclosed and claimed herein, the following terms and abbreviations are 

defined as follows: 

Alkylparaben - refers to a C^ to C4 alkylparaben. Preferably, alkylparaben is methylparaben, ethylparaben. pro- 
pylparaben, or butylparaben. 

Base pair (bp) refers to DNA or RNA. The abbreviations A,C,G, and T correspond to the 5'-monophosphate 
ss forms of the nucleotides (deoxy)adenine, (deoxy)cytidine, (deoxy) guanine, and (deoxy)thymine, respectively, when 
they occur in DNA molecules. The abbreviations U,C,G, and T correspond to the 5'-monophosphate forms of the 
nucleosides uracil, cytidine, guanine, and thymine, respectively when they occur in RNA molecules. In double stranded 
DNA, base pair may refer to a partnership of A with T or C with G. In a DNA/RNA heteroduplex, base pair may refer 
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IS 



35 



to a partnership of T with U or C with G. 

Obesity protein - refers to a protein of the Formula (I): 



(SEQ ID NO: 1) 
5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Xaa Ser Val Ser Ser 

35 40 -45 

Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro He 

50 55 60 



Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

20 65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

25 100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

30 1 30 1 35 1 40 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys (I) 



wherein: 



Xaa at position 28 is Gin or absent; 

said protein having at least one of the following substitutions: 
40 Gin at position 4 is replaced with Glu; 

Gin at position 7 is replaced with Glu; 

Asn at position 22 is replaced with Gin or Asp; 

Thr at position 27 is replaced with Ala; 

Xaa at position 28 Is replaced with Glu; 
45 Gin at position 34 is replaced with Glu; 

Met at position 54 is replaced with methionine sulfoxide, Leu, lie, Val, Ala, or Gly; 

Gin at position 56 is replaced with Glu; 

Gin at position 62 is replaced with Glu; 

Gin at position 63 is replaced with Glu; 
so Met at position 68 is replaced with methionine sulfoxide, Leu, lie, N^l, Ala, or Gly; 

Asn at position 72 is replaced with Gin, Glu, or Asp; 

Gin at position 75 is replaced with Glu; 

Ser at position 77 is replaced with Ala; 

Asn at position 78 is replaced with Gin or Asp; 
55 Asn at position 82 is replaced with Gin or Asp; 

His at position 97 is replaced with Gin, Asn, Ala, Gly, Ser, or Pro; 

Trp at position 100 is replaced with Ala, Glu, Asp, Asn, Met, lie, Phe, Tyr, Ser, Thr, Gly, Gin. Val or Leu; 
Ala at position 101 is replaced with Ser, Asn, Gly, His, Pro, Thr, or Val; 
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Ser at position 102 is replaced with Arg; 
Gly at position 1 03 is replaced witli Ala; 
Glu at position 105 is replaced with Gin; 
Thr at position 1 06 is replaced with Lys or Ser; 
Leu at position 107 is replaced with Pro; 
Asp at position 108 is replaced with Glu; 
Gly at position 111 is replaced with Asp; 
Gly at position 118 is replaced with Leu; 
Gin at position 130 is replaced with Glu; 
Gin at position 134 is replaced with Glu; 

Met at position 136 is replaced with methionine sulfoxide, Leu, lie, \^l, Ala, or Gly; 
Trp at position 138 is replaced with Ala, Glu. Asp, Asn, Met, lie. Phe, Tyr, Ser. Thr, Gly, Gin, Val or Leu; or 
Gin at position 1 39 is replaced with Glu; 

IS or a pharmaceutically acceptable salt thereof. 

Plasmid - an extrachromosomal self-replicating genetic element. 

Reading frame - the nucleotide sequence from which translation occurs "read" in triplets by the translational ap- 
paratus of tRNA, ribosomes and associated factors, each triplet corresponding to a particular amino acid. Because 
each triplet is distinct and of the same length, the coding sequence must be a multiple of three. A base pair insertion 

20 or deletion (termed a f rameshift mutation) may result in two different proteins being coded for by the same DN A segment. 
To insure against this, the triplet codons corresponding to the desired polypeptide must be aligned in multiples of three 
from the initiation codon, i.e. the correct "reading frame" must be maintained. In the creation of fusion proteins containing 
a chelating peptide, the reading frame of the DNA sequence encoding the structural protein must be maintained in the 
DNA sequence encoding the chelating peptide. 

2S Recombinant DNA Cloning Vector - any autonomously replicating agent including, but not limited to, plasmids 

and phages, comprising a DNA molecule to which one or more additional DNA segments can or have been added. 

Recombinant DNA Expression Vector ~ any recombinant DNA cloning vector in which a promoter has been incor- 
porated. 

Replicon - A DNA sequence that controls and allows for autonomous replication of a plasmid or other vector. 
30 Transcription - the process whereby information contained in a nucleotide sequence of DNA is transferred to a 

complementary RNA sequence. 

Translation -■ the process whereby the genetic information of messenger RNA is used to specify and direct the 
synthesis of a polypeptide chain. 

Vector - a replicon used for the transformation of cells in gene manipulation bearing polynucleotide sequences 
35 corresponding to appropriate protein molecules which, when combined with appropriate control sequences, confer 
specific properties on the host cell to be transformed. Plasmids, viruses, and bacteriophage are surtable vectors, since 
they are replicons in their own right. Artificial vectors are constructed by cutting and joining DNA molecules from different 
sources using restriction enzymes and ligases. Vectors include Recombinant DNA cloning vectors and Recombinant 
DNA expression vectors. 

40 Treating - as used herein, describes the management and care of a patient for the purpose of combating the 

disease, condition, or disorder and includes the administration of a protein of present invention to prevent the onset of 
the symptoms or complications, alleviating the symptoms or complications, or eliminating the disease, condition, or 
disorder. Treating as used herein includes the administration of the protein for cosmetic purposes. A cosmetic purpose 
seeks to control the weight of a mammal to improve bodily appearance. 

4S Isotonicity agent -- isotonicity agent refers to an agent that is physiologically tolerated and embarks a suitable 

tonicity to the formulation to prevent the net flow of water across the cell membrane. Compounds, such as glycerin, 
are commonly used for such purposes at known concentrations. Other possible isotonicity agents include salts, e.g., 
NaCI, dextrose, and lactose. 

Physiologically tolerated buffer - a physiologically tolerated buffer is known in the art. A physiologically tolerated 
50 buffer is preferably a phosphate buffer, like sodium phosphate. Other physiologically tolerated buffers include TRIS, 
sodium acetate, sodium citrate as well as conventional glycine and arginine based buffering systems. The selection 
and concentration of buffer is known in the art. 

The nucleotide and amino acid abbreviations are accepted by the United States Patent and Trademark Office as 
set forth in 37 C.F.R. § 1.822 (b) (2) (1993). One skilled in the art would recognize that certain amino acids are prone 
55 to rearrangement. For example. Asp may rearrange to aspartimide and isoasparigine as described in I. Schon et al.. 
Int. J. Peptide Protein Res. J4: 485-94 (1979) and references cited therein. These rearrangement derivatives are 
included within the scope of the present invention. Unless otherwise indicated the amino acids are in the L configuration. 

As noted above, the invention provides a soluble parenteral formulation, comprising a novel obesity protein and 
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a preservative selected from the group consisting of alkylparaben or chlorobutanol. In the presence of these preserv- 
atives, the presently claimed obesity proteins unexpectedly remain in solution making a soluble, parenteral formulation 
possible. 

A parenteral formulation must meet guidelines for preservative effectiveness to be a commercially viable product. 
5 Preservatives known in the art as being acceptable in parenteral formulations include: phenol, m-cresol, benzyl alcohol, 
methylparaben, chlorobutanol, p-cresol, phenylmercuric nitrate, thimerosal and various mixtures thereof. See, e.g., 
WALLHAUSER, K.-H., DEVELOP. BIOL STANDARD. 24, pp. 9-28 (Basel. S. Krager, 1974). 

Most unexpectedly, a select number of presen/atives have been identified that provide good formulation stability 
These select preservatives are alkylparaben or chlorobutanol. Most preferrably, the presen^ative is methylparaben, 
10 ethylparaben, propylparaben, or butylparaben. Most unexpectedly, the obesity protein does not aggregate or precipitate 
in the presence of these preservatives at the conditions necessary to formulate and particularly conditions at 37°C. 

The concentration of obesity protein in the formulation is about 1.0 mg/mL to about iOO mg/mL; preferably about 
5.0 mg/mL to about 50.0 mg/mL; most preferably, about 10.0 mg/mL. The concentration of preservative required is the 
concentration necessary to maintain preservative effectiveness. The relative amounts of preservative necessary to 
IS maintain preservative effectiveness varies with the preservative used. Generally, the amount necessary can be found 
in WALLHAUSER, K.-H., DEVELOP. BIOL STANDARD. 24, pp, 9-28 (Basel, S. Krager, 1974), herein incorporated by 
reference. The optimal concentration of the presen/ative depends on the preservative, its solubility, and the pH of the 
formulation. 

An isotonicity agent, preferably glycerin, may be additionally added to the formulation. The concentration of the . 

20 isotonicity agent is in the range known in the art for parenteral formulations, preferably about 16 mg/mL. The pH of the 
formulation may also be buffered with a physiologically tolerated buffer, preferably a phosphate buffer, like sodium 
phosphate. Other acceptable physiologically tolerated buffers include TRIS, sodium acetate, or sodium citrate. The 
selection and concentration of buffer is known in the art; however, the formulations of the present invention are pref- 
erably prepared in the absence of salt so that the ionic strength of the formulation is minimized. 

2S Also included as optional embodiments are agents known to be synergistic with the preservative to provide en- 

hanced antimicrobial effects. Such agents are recognized in the art and include for example EDTA, EGTA, citrate, and 
caprylic acid. The concentration of these agents varies with the desired preservation effect. A preferred agent is EDTA, 
particularly in conjunction with an alkylparabens at a concentration of about 0.025% to 0.4%. 

The present formulations may optionally contain a physiologically tolerated solvent such as glycerol, propylene, 

30 glycol, phenoxy ethanol, phenyl ethyl alcohol. Such solvents are generally added to enhance the solubility of the protein 
in the presentation effectiveness of the formulation. 

Other additives, such as a pharmaceutically acceptable solubllizers like Tween 20 (polyoxyethylene (20) sorbitan 
monolaurate), Tween 40 (polyoxyethylene (20) sorbitan monopalmitate), Tween 80 (polyoxyethylene (20) sorbitan 
monooleate), Pluronic F68 (polyoxyethylene polyoxypropylene block copolymers), BRI J 35 (polyoxyethylene (23) lauryl 

35 ether), and PEG (polyethylene glycol) may optionally be added to the formulation to reduce aggregation. These addi- 
tives are particularly useful if a pump or plastic container is used to administer the formulation. The presence of phar- 
maceutically acceptable surfactant mitigates the propensity for the protein to aggregate. 

The parenteral formulations of the present invention can be prepared using conventional dissolution and mixing 
procedures. To prepare a suitable formulation, for example, a measured amount of human obesity protein in water is 

40 combined with the desired preservative in water in quantities sufficient to provide the protein and preservative at the 
desired concentration. The formulation is generally sterile filtered prior to administration. Variations of this process 
would be recognized by one of ordinary skill in the art. For example, the order the components are added, the surfactant 
used, the temperature and pH at which the formulation is prepared may be optimized for the concentration and means 
of administration used. 

45 Preferably, the pH of the present formulations is about pH 7.0 to about 9.0 and, most preferably 7.6 to 8.5. The 

formulations are preferably prepared under basic conditions by mixing the obesity protein and preservative at a pH 
greater than pH 7.0. Preferably, the pH is about 7.6 to 8.5, and most preferably about pH 7.8. Ideally, a presen/ative 
and water solution is mixed at pH 7.6 to 8.5. Added to this solution is obesity protein in water The pH is adjusted, as 
necessary, to about pH 7.6 to 8.5. The solution is then held until the components are dissolved, approximately 20 to 

SO- 40 minutes, preferably about 30 minutes. The base used to adjust the pH of the formulation may be one or more 
pharmaceutically acceptable bases such as sodium hydroxide or potassium hydroxide. The preferred base is sodium 
hydroxide. 

The formulations prepared in accordance with the present invention may be used in a syringe, injector, pumps or 
any other device recognized in the art for parenteral administration. 
55 Preferred proteins of the present invention are those of Formula I, wherein: 

Gin at position 4 is replaced with Glu; 
Gin at position 7 is replaced with Glu; 
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Asn at position 22 is replaced with Gin or Asp; 

Thr at position 27 is replaced with Ala; 

Gin at position 28 is replaced with Glu; 

Gin at position 34 is replaced with Glu; 
s Met at position 54 is replaced with nnethionine sulfoxide. Leu, or Ala; 

Gin at position 56 is replaced with Glu; 

Gin at position 62 is replaced with Glu; 

Gin at position 63 is replaced with Glu; 

Met at position 68 is replaced with methionine sulfoxide, or Leu; 
10 Asn at position 72 is replaced with Gin or Asp; 

Gin at position 75 is replaced with Glu; 

Asn at position 78 is replaced with Gin or Asp; 

Asn at position 82 is replaced with Gin or Asp; 

Gin at position 130 is replaced with Glu; 
IS Gin at position 134 is replaced with Glu; 

Met at position 136 is replaced with methionine sulfoxide, Leu, lie; or 

Gin at position 139 is replaced with Glu. 

Other preferred proteins are those of Formula I wherein: 

20 

Asn at position 22 is replaced with Gin or Asp; 
Thr at position 27 is replaced with Ala; 

Met at position 54 is replaced with methionine sulfoxide, Leu, or Ala; 
Met at position 68 is replaced with methionine sulfoxide, or Leu; 
25 Asn at position 72 is replaced with Gin or Asp; 

Asn at position 78 Is replaced with Gin or Asp; 

Asn at position 82 is replaced with Gin or Asp; or 

Met at position 136 Is replaced with methionine sulfoxide, Leu. or He. 

30 Still yet additional preferred proteins are those of Formula I, wherein: 

Asn at position 22 is replaced with Gin or Asp; 
Thr at position 27 Is replaced with Ala; 
Met at position 54 is replaced with Leu, or Ala; 
3S Met at position 68 is replaced with Leu; 

Asn at position 72 is replaced with Gin or Asp; 
Asn at position 78 is replaced with Gin or Asp; 
Asn at position 82 is replaced with Gin or Asp; or 
Met at position 1 36 Is replaced with Leu, or lie. 

40 

Preferred species within Formula I Include species of SEQ ID NO: 2 and SEO ID NO: 3: 
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25 



(SEQ ID NO: 2) 
5 10 15 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys Thr 

20 25 30 ■ 

He Val Thr Arg He Asp Asp He Ser His Thr Gin Ser Val Ser Ser 

.35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin lie 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys 



30 



35 



40 



(SEQ ID NO: 3) 
5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Ala Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 



45 



85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 



115 

Val Leu Glu Ala 
130 

Leu Gin Gly Ser 

145 

Gly Cys 



120 

Ser Gly Tyr Ser 
135 

Leu Gin Asp Met 



Thr Glu Val Val 
140 

Leu Trp Gin Leu 



125 

Ala Leu Ser Arg 
Asp Leu Ser Pro 



7 



EP 0 835 879 A2 



10 



15 



Most significantly, other preferred proteins of the present formulations are specific substitutions to arhino acid 
residues 97 to 111, and/or 138 of the proteins of SEQ ID NO: 1. These substitutions result in additional stability and 
are superior therapeutic agents. These specific proteins are more readily formulated and are more pharmaceutically 
elegant, which results in superior delivery of therapeutic doses. Accordingly, preferred embodiments are formulation 
comprising proteins of the Formula II: 

(SEQ ID NO: 4) 
5 10 15 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys Thr 

20 25 • 30 . 

lie Val Thr Arg He Asn Asp He Ser His Thr Xaa Ser Val Ser Ser . 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

30 Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys (II) 



20 



25 



35 wherein: 

Xaa at position 28 is Gin or absent; 
said protein having at least one substitution selected from the group consisting of: 

His at position 97 is replaced with Gin, Asn, Ala, Gly, Ser, or Pro; 
40 Trp at position 100 is replaced with Ala, Glu, Asp, Asn, Met, lie, Phe, Tyr, Ser, Thr, Giy, Gin, Val or Leu; 

Ala at position 101 is replaced with Ser, Asn, Gly, His, Pro, Thr, or Val; 

Ser at position 102 is replaced with Arg; 

Gly at position 103 is replaced with Ala; 

Glu at position 105 is replaced with Gin; 
45 Thr at position 106 is replaced with Lys or Ser; 

Leu at position 107 is replaced with Pro; 

Asp at position 108 Is replaced with Glu; 

Gly at position 111 is replaced with Asp; or 

Trp at position 138 is replaced with Ala, Glu, Asp, Asn, Met, lie. Phe, Tyr, Ser, Thr. Gly, Gin, Val or Leu; 

50 

or a pharmaceutically acceptable salt thereof. 

Preferred proteins are a protein of the Formula III: 



55 
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{SEQ ID NO: 5) 

5 10 15 

5 Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

^0 Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

IS Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

20 His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 



2S 



30 



130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys (III) 



said protein having at least one substitution selected from the group consisting of: 



His at position 97 is replaced with Gin, Asn, Ala, Gly, Ser, or Pro; 

Trp at position 100 is replaced with Ala, Glu, Asp, Asn, Met, lie, Phe, Tyr, Ser, Thr. Gly, Gin, Val or Leu; 
35 Ala at position 101 is replaced with Ser, Asn, Gly, His, Pro, Thr, or Val; 

Ser at position 102 is replaced with Arg; 

Gly at position 103 is replaced with Ala; 

Glu at position 105 is replaced with Gin; 

Thr at position 106 is replaced with Lys or Ser; 
40 Leu at position 107 is replaced with Pro; 

Asp at position 108 is replaced with Glu; 

Gly at position 1 1 1 is replaced with Asp; or 

Trp at position 138 is replaced with Ala, Glu, Asp, Asn, Met, lie. Phe. Tyr, Ser. Thr, Gly, Gin, Val or Leu; 

45 or a pharmaceutically acceptable salt thereof. 

More preferred proteins of the Formula III are those wherein: 

His at position 97 is replaced with Gin, Asn, Ala, Gly, Ser or Pro; 

Trp at position 100 is replaced with Ala, Glu, Asp, Asn, Met. He, Phe, Tyr, Ser. Thr, Gly, Gin or Leu; 
so Ala at position 101 is replaced with Ser, Asn, Gly, His, Pro, Thr or Val; 

Glu at position 105 is replaced with Gin; 

Thr at position 106 is replaced with Lys or Ser; 

Leu at position 107 is replaced with Pro; 

Asp at position 108 is replaced with Glu; 
55 Gly at position 111 is replaced with Asp; or 

Trp at position 138 is replaced with Ala. Glu, Asp, Asn, Met. lie. Phe, Tyr, Ser, Thr, Gly, Gin, Val or Leu. 

Other preferred proteins of the Formula III are those wherein: 
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His at position 97 is replaced with Ser or Pro; 

Trp at position 100 is replaced with Ala, Gly, Gin, Vat, He, or Leu; 

Ala at position 101 is replaced with Thr; or 

Trp at position 1 38 is replaced with Ala. lie. Gly, Gin, Val or Leu. 

5 

Yet still additional preferred proteins of the Formula III are those wherein: 

His at position 97 is replaced with Ser or Pro; 
Trp at positbn 100 is replaced with Ala, G!n or Leu; 
10 Ala at position 101 is replaced with Thr; or 

Trp at position 1 38 is replaced with Gin. 

Most preferred species of the present invention are those proteins having a dl-sulfide bond between Cysat position 
96 and Cys at position 146. Examples of most preferred species include species of SEQ ID NO:^ 6-13: 

IS 

(SEQ ID NO: 6) 
5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

25 50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 



30 



35 



40 



45 



85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 



Gly Cys 

said protein having a di-sulfide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
salts thereof. 



so 



55 
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10 



(SEQ ID NO: 7) 
5 .10 15 

Val Pro He Gin Lys Val Gin Asp. Asp Thr Lys Thr Leu lie Lys Thr 

20 25 30 • 

lie Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val S^r Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

20 His Leu Pro Gin Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 



75 



25 



130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys . 

30 said protein having a di-sulf ide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
salts thereof. 

(SEQ ID NO: 8) 

35 5 . 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

40 3 5 4 0 4 5 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin lie 

^65 70 75 80 ■ 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

50 



55 
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Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 

145 

Gly Cys 

IS said protein having a dl-sutfide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
salts thereof. 



10 



20 



25 



30 



(SEQ ID NO: 9) 
5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

35 100 105 110 

His Leu Pro Gin Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

^0 130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 

145 

Gly Cys 



45 



SO 



55 



said protein having a di-sulf ide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
salts thereof. 



(SEQ ID NO: 10) 
5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Ala Gin Ser Val Ser Ser 

35 40 45 
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Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 . 60 

5 Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

10 Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

. 100 105 110 

His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

75 Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu S6r Pro 

145 

20 Gly Cys 

said protein having a di-sulfide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
salts thereof . 

25 

(SEQ ID NO: 11) 
5 10 15 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin lie 

65 70 . 75 . 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

40 85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly. Gly 

45 115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 

50 145 

Gly Cys 

said protein having a di-sulfide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
salts thereof. 

55 
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10 



IS 



20 



(SEQ ID NO: 12) 
5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu. Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

Ser Leu Pro Gin Thr Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 

145 

Gly Cys 

said protein having a di-sutfide bond between Cys at position 96 and Cys at position 1 46; or pharmaceutically acceptable 
30 salts thereof. 



(SEQ ID NO: 13) 
5 10 15 

35 Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 .40 45 

4^ Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 



2S 



45 



50 



65 70 75 80 

Leu Thr Ser Met Pro S$r Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

Ser Leu Pro Gin Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 



55 
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130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 

145 

5 Gly Cys 



said protein having a di-sulf ide bond between Cys at position 96 and Cys at position 1 46; or pharnnaceutically acceptable 
salts thereof. 

The obesity proteins of the present invention can be prepared by any of a variety of recognized peptide synthesis 
10 techniques including classical (solution) methods, solid phase methods, semi synthetic methods, and more recent 
recombinant DNA methods. Recombinant methods are preferred if a high yield is desired. The basic steps in the 
recombinant production of protein include: 

a) construction of a synthetic or semi-synthetic (or isolation from natural sources) DNA encoding the obesity protein, 
IS b) integrating the coding sequence into an expression vector in a manner suitable for the expression of the protein 

either alone or as a fusion protein, 

c) transforming an appropriate eukaryotic or prokaryotic host cell w|th the expression vector, and 

d) recovering and purifying the recombinantly produced protein. 

20 Synthetic genes, the jn vitro or in vivo transcription and translation of which will result in the production of the 

protein may be constructed by techniques well known in the art. Owing to the natural degeneracy of the genetic code, 
the skilled artisan will recognize that a sizable yet definite number of DNA sequences may be constructed which encode 
the desired proteins, in the preferred practice of the invention, synthesis is achieved by recombinant DNA technology 
Methodology of synthetic gene construction is well known in theart. For example, see Brown, et al. (1979) Methods 

2S in Enzymology, Academic Press, N.Y., Vol. 68^ pgs. 109-151. The DNA sequence corresponding to the synthetic 
claimed protein gene may be generated using conventional DNA synthesizing apparatus such as the Applied Biosys- 
tems Model 380Aor 380B DNA synthesizers (commercially available from Applied Biosystems, Inc.. 850 Lincoln Center 
Drive, Foster City, CA 94404). It may be desirable in some applications to modify the coding sequence of the obesity 
protein so as to incorporate a convenient protease sensitive cleavage site, e.g., between the signal peptide and the 

30 structural protein facilitating the controlled excision of the signal peptide from the fusion protein construct. 

The gene encoding the obesity protein may also be created by using polymerase chain reaction (PCR). The tem- 
plate can be a cDNA library (commercially available from CLONETECH or STRATAGENE) or mPNA Isolated from the 
desired arrival adipose tissue. Such methodologies are well known in the art Maniatis, et aj. Molecular Cloning: A 
Laboratorv Manual. Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989). 

36 The constructed or isolated DNA sequences are useful for expressing the obesity protein either by direct expression 

or as fusion protein. When the sequences are used in a fusion gene, the resulting product wilt require enzymatic or 
chemical cleavage. A variety of peptidases which cleave a polypeptide at specific sites or digest the peptides from the 
amino or carboxy termini (e.g. diaminopeptidase) of the peptide chain are known. Furthermore, particular chemicals 
(e.g. cyanogen bromide) will cleave a polypeptide chain at specific sites. The skilled artisan will appreciate the modi- 

40 fications necessary to the amino acid sequence (and synthetic or semi-synthetic coding sequence if recombinant means 
are employed) to incorporate site-specific internal cleavage sites. See U.S. Patent No. 5,126,249; Carter P., Site Spe- 
cific Proteolysis of Fusion Proteins, Ch. 13 in Protein Purification: From Molecular Mechanisms to Large Scale Proc- 
esses. American Chemical Soc, Washington, D.C. (1990). 

Construction of suitable vectors containing the desired coding and control sequences employ standard ligation 

45 techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the form desired to form the 
plasmids required. 

To effect the translation of the desired protein, one inserts the engineered synthetic DNA sequence In any of a 
plethora of appropriate recombinant DNA expression vectors through the use of appropriate restriction endonucleases. 
A synthetic coding sequence may be designed to possess restriction endonuclease cleavage sites at either end of the 

so transcript to facilitate isolation from and Integration Into these expression and amplification and expression plasmids. 
The isolated cDN A coding sequence may be readily modified by the use of synthetic linkers to facilitate the incorporation 
of this sequence Into the desired cloning vectors by techniques well known in the art. The particular endonucleases 
employed will be dictated by the restriction endonuclease cleavage pattern of the parent expression vector to be em- 
ployed. The restriction sites are chosen so as to properly orient the coding sequence with control sequences; to achieve 

ss proper in-frame reading and expression of the protein. 

In general, plasmid vectors containing promoters and control sequences which are derived from species compatible 
with the host cell are used with these hosts. The vector ordinarily carries a replication origin as well as marker sequences 
which are capable of providing phenotypic selection in transformed celts. For example. E. coli is typically transformed 
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using pBR322, a plasmid derived from an E. coN species (Bolivar, et aj., Gene 2: 95 (1 977)). Plasmid pBR322 contains 
genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells. The 
pBR322 plasmid, or other microbial plasmid must also contain or be modified to contain promoters and other control 
elements commonly used in recombinant DNA technology. 

s The desired coding sequence is inserted into an expression vector in the proper orientation to be transcribed from 

a promoter and ribosome binding site, both of which should be functional in the host cell in which the protein is to be 
expressed. An example of such an expression vector is a plasmid described in Belagaje et al., U.S. patent No. 
5,304,493, the teachings of which are herein incorporated by reference. The gene encoding A-C-B proinsulin described 
in U.S. patent No. 5,304,493 can be removed from the plasmid pRB182 with restriction enzymes Ndel and Bam HI. 

10 The isolated DNA sequences can be inserted into the plasmid backbone on a Ndel/BamHI restriction fragment cassette. 

In general, prokaryotes are used for cloning of DNA sequences in constructing the vectors useful in the invention. 
For example, E. coli K1 2 strain 294 (ATCC No. 31446) is particularly useful. Other microbial strains which may be used 
include Ecoli B and E. coH XI 776 (ATCC No. 31537). These examples are illustrative rather than limiting. 

Prokaryotes also are used for expression. The aforementioned strains, as well as^ coji W3110 (prototrophic, 

IS ATCC No. 27325), bacilli such as Bacillus subtilis. and other enterobacterlaceae such as Salmonella typhimurium or 
Serratia marcescans. and various pseudomonas species may be used. Promoters suitable for use with prokaryotic 
hosts include the P-lactamase (vector pGX2907 [ATCC 39344] contains the replicon and p-lactamase gene) and lactose 
promoter systems (Chang et al.. Nature , 275 :615 (1978); and Goeddel et al., Nature 281 :544 (1979)), alkaline phos- 
phatase, the tryptophan (trp) promoter system (vector pATHI [ATCC 37695] is designed to facilitate expression of an 

20 open reading frame as a trpE fusion protein under control of the trp promoter) and hybrid promoters such as the tac 
promoter (isolatable from plasmid pDR540 ATCC-37282). However, other functional bacterial promoters, whose nu- 
cleotide sequences are generally known, enable one of skill in the art to ligate them to DNA encoding the protein using 
linkers or adaptors to supply any required restriction sites. Promoters for use in bacterial systems also will contain a 
Shine-Dalgarno sequence operably linked to the DNA encoding protein. 

26 The DNA molecules may also be recombinantly produced in eukaryotic expression systems. Preferred promoters 

controlling transcription In mammalian host cells may be obtained from various sources, for example, the genomes of 
viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably 
cytomegalovirus, or from heterologous mammalian promoters, e.g. p-actin promoter. The early and late pronrraters of 
the SV40 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral origin of 

30 replication. Fiers, et aj.. Nature , 273:113 (1978). The entire SV40 genome may be obtained from plasmid pBRSV. 
ATCC 45019. The immediate early promoter of the human cytomegalovirus may be obtained from plasmid pCfy^Bb 
(ATCC 77177). Of course, promoters from the host cell or related species also are useful herein. 

Transcription of the DNA by higher eukaryotes is increased by inserting an enhancer sequence into the vector. 
Enhancers are cis-acting elements of DNA, usually about 1 0-300 bp, that act on a promoter to increase its transcription. 

35 Enhancers are relatively oriented and positioned independently and have been found 5' (Laimins, L. jgt al., PNAS 78 : 
993 (1981)) and 3' (Lusky, M. L., et al., Mol. Cell Bio. 3:1108 (1983)) to the transcription unit, within an intron (Banerji. 
J. L. etal.. Cell 33:729 (1983)) as well as within the coding sequence itself (Osborne, T R. etal.. Mol. Ceil Bio. 4:1293 
(1984)). Many enhancer sequences are now known from mammalian genes (globin, RSV, SV40, EMC, elastase, al- 
bumin, alpha-fetoprotein and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus, Exam- 

40 pies include the SV40 late enhancer, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated cells from 
other multicellular organisms) will also contain sequences necessary for the termination of transcription which may 
affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the 

45 mRNA encoding protein. The 3' untranslated regions also include transcription termination sites. 

Expression vectors may contain a selection gene, also termed a selectable marker. Examples of suitable selectable 
markers for mammalian cells are dihydrofolate reductase (DHFR, which may be derived from the BgjII/Hindll I restriction 
fragment of pJOD-10 [ATCC 68815]), thymidine kinase (herpes simplex virus thymidine kinase is contained on the 
Bam HI fragment of vP-5 clone [ATCC 2028]) or neomycin (G418) resistance genes (obtainable from pNN414 yeast 

so artificial chromosome vector [ATCC 37682]). When such selectable markers are successfully transferred into a mam- 
malian host cell, the transfected mammalian host cell can survive if placed under selective pressure. There are two 
widely used distinct categories of selective regimes. The first category is based on a cell's metabolism and the use of 
a mutant cell line which lacks the ability to grow without a supplemented media. Two examples are: CHO DHFR" cells 
(ATCC CRL-9096) and mouse LTK' cells (L-M(TK-) ATCC CCL-2.3). These cells lack the ability to grow without the 

ss addition of such nutrients as thymidine or hypoxanthine. Because these cells lack certain genes necessary for a com- 
plete nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are provided in a supplemented 
media. An alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the 
respective genes, thus altering their growth requirements. Individual cells which were not transformed with the DHFR 
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or TK gene will not be capable of survival in nonsupplemented media. 

The second category is dominant selection which refers to a selection scheme used In any cell type and does not 
require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells 
which have a novel gene would express a protein conveying drug resistance and would survive the selection. Examples 

s of such dominant selection use the drugs neomycin, Southern R and Berg, R, J. Molec. Appl. Genet. J_: 327 (1982), 
mycophenollc acid, Mulligan, R. C. and Berg, R Science 209 :1422 (1980), or hygromycin, Sugden, B. et al.. Mol Cell. 
Biol. 5:410-413 (1985). The three examples given above employ bacterial genes under eukaryotic control to convey 
resistance to the appropriate drug G41 8 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively 
A preferred vector for eukaryotic expression is pRc/CMV. pRc/CMV is commercially available from Invitrogen Cor- 

10 poration. 3985 Sorrento Valley Blvd., San Diego, CA 92121. To confirm correct sequences in plasmids constructed, 
the ligation mixtures are used to transform E. coll K12 strain DH10B (ATCC 31446) and successful transformants 
selected by antibiotic resistance where appropriate. Plasmids from the transformants are prepared, analyzed by re- 
striction and/or sequence by the method of Messing, et al.. Nucleic Acids Res. 9:309 (1981). 

Host cells may be transformed with the expression vectors of this invention and cultured in conventional nutrient 

IS media modified as is appropriate for inducing promoters, selecting transformants or amplifying genes. The culture 
conditions, such as temperature. pH and the like, are those previously used with the host cell selected for expression, 
and will be apparent to the ordinarily skilled artisan. The techniques of transforming cells with the. aforementioned 
vectors are well known in the art and may be found in such general references as Maniatis, et al.. Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989), 

20 or Current Protocols in Molecular Biology (1 989) and supplements. 

Preferred suitable host cells for expressing the vectors encoding the claimed proteins in higher eukaryotes include: 
African green monkey kidney line cell line transformed by SV40 (COS-7, ATCC CRL-1 651 ); transformed human primary 
embryonal kidney cell line 293, (Graham, F. L. etal.. J. Gen Virol. 36:59-72 (1977). Virology 77:319-329. Virology 86: 
10-21); baby hamster kidney cells (BHK-21(C-13), ATCC CCL-10. Virology 16:147 (1962)); Chinese hamster ovary 

25 cells CHO-DHFR' (ATCC CRL-9096), mouse Sertoli cells (TM4, ATCC CRL-1715. Biol. Reprod. 23:243-250 (1980)); 
African green monkey kidney cells (VERO 76, ATCC CRL-1 587); human cervical epitheloid carcinoma cells (HeLa, 
ATCC CCL-2); canine kidney cells (MDCK, ATCC CCL-34); buffalo rat liver cells (BRL 3A, ATCC CRL-1 442); human 
diploid lung cells (WI-38, ATCC CCL-75); human hepatocellular carcinoma cells (Hep G2, ATCC HB-8065);and mouse 
mammary tumor cells (MMT 060562, ATCC CCL51 ). 

30 In addition to prokaryotes, unicellular eukaryotes such as yeast cultures may also be used. Saccharomyces cer- 

evisiae. or common baker's yeast is the most commonly used eukaryotic microorganism, although a.number of other 
strains are commonly available. For expression in Saccharomyces, the plasmid YRp7, for example, (ATCC-40053, 
Stinchcomb, et al.. Nature 282:39 (1 979); Kingsman et al.. Gene 7: 1 41 (1 979); Tschemper et aj. , Gene 10: 1 57 (1 980)) 
is commonly used. This plasmid already contains the trp gene which provides a selection marker for a mutant strain 

35 of yeast lacking the ability to grow in tryptophan, for example ATCC no. 44076 or PEP4-1 (Jones, Genetics 85:12 
(1977)). 

Suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase (found 
on plasmid pAP12BD ATCC 53231 and described in U.S. Patent No. 4,935,350, June 19, 1990) or other glycolytic 
enzymes such as enolase (found on plasmid pACl ATCC 39532), glyceraldehyde-3-phosphate dehydrogenase (derived 

40 from plasmid pHcGAPCI ATCC 57090, 57091 ), zymomonas mobilis (United States Patent No. 5,000,000 issued March 
19, 1991), hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phos- 
phoglycerate mutase, pyruvate kinase, triosephosphate Isomerase, phosphoglucose Isomerase, and glucokinase. 

Other yeast promoters, which contain inducible promoters having the additional advantage of transcription con- 
trolled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phos- 

45 phatase, degradative enzymes associated with nitrogen metabolism, metallothionein (contained on plasmid vector 
pCL28XhoLHBPV ATCC 39475, United States Patent No. 4,840,896), glyceraldehyde 3-phosphate dehydrogenase, 
and enzymes responsible for maltose and galactose (G AL1 found on plasmid pRyl21 ATCC 37658) utilization. Suitable 
vectors and promoters for use in yeast expression are further described In R. Hitzeman et a!., European Patent Pub- 
lication No. 73.657 A. Yeast enhancers such as the UAS Gal from Saccharomyces cerevisiae (found in conjunction 

so with the CYC1 promoter on plasmid YEpsec-hllbeta ATCC 67024), also are advantageously used with yeast promot- 
ers. 

The following examples and preparations are provided merely to further illustrate the preparation of the formulations 
of the invention. The scope of the invention is not construed as merely consisting of the following examples. 

55 
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Example 1 

Obesity Protein Formulation 

s To generate a solution of a protein ot SEQ ID NO: 6 (hereinafter, Protein NO: 6) lyophilized solid material is first 

dissolved In water to generate a stock solution (stock 1). The concentration of Protein NO: 6 in stock 1 is verified by 
UV/VIs spectrophometry using the known extinction coefficient for Protein NO: 6 at the maximum spectral absorbance 
(279nm or 280nm), measuring the maximum spectral absorbance (279nm or 280nm), and utilizing the dilution factor 
A stock presen/ative solution containing methylparaben, for example, is prepared by dissolving the solid in water (stock 

10 2). A solution of Protein NO: 6 at 1 .6 mg/mL is prepared by addition of an aliquot of stock 1 to a container which held 
an aliquot of stock 2 along with the required quantity of water, at an alkaline pH (7.8±0.3); adjusted if necessary with 
HCI or NaOH. After an appropriate incubation period at room temperature (30 minutes), the solution pH is examined 
and adjusted if necessary with trace quantities of HCI or NaOH, to yield Protein NO: 6 solution at 1 .6 mg/mL with 
0.17 % methylparaben pH 7.8±0.1. The solution is then hand-filtered using a glass syringe with an attached 0.22^m 

IS syringe filter into a glass vial. 

Example 2 

Obesity Protein Formulation 

20 

To generate a solution of a protein of SEQ ID NO: 6 (hereinafter Protein NO: 6), lyophilized solid material is first 
dissolved in water to generate a stock solution (stock 1). The concentration of Protein NO: 6 in stock 1 is verified by 
U V/Vis Spectrophometry using the known extinction coefficient for Protein NO: 6 at the maximum spectral absorbance 
(279nm or 280nm), measuring the maximum spectral absorbance (279nm or 280nm), and utilizing the dilution factor 

25 A stock preservative solution containing chlorobutanol, for example, is prepared by dissolving the solid in water (stock 
2). A solution of Protein NO: 6 at 1 .6 mg/mL is prepared by addition of an aliquot of stock 1 to a container which held 
an aliquot of stock 2 along with the required quantity of water, at an alkaline pH (7.840.3); adjusted if necessary with 
HCI or NaOH. After an appropriate incubation period at room temperature (30 minutes), the solution pH is examined 
and adjusted if necessary with trace ^iL quantities of HCI or NaOH, to yield an Protein NO: 6 solution at 1.6 mg/mL 

30 with 0.50% chlorobutanol pH 7,8±0.1 . The solution is then hand-filtered using a glass syringe with an attached 0.22|im 
syringe filter into a glass vial. 

Example 3 

35 Obesity Protein Formulation 

To generate a solution of a protein of SEQ ID NO: 6 (hereinafter Protein NO: 6), solid bulk material, lyophilized 
from a neutral water solution, is redissolved in water to generate a stock solution (stock 1 ). The concentration of Protein 
NO: 6 in stock 1 is verified by UV/Vis Spectrophometry by multiplying the maximum spectral absorbance (279nm or 

40 280nm) by the dilution factor divided by the known extinction coefficient for Protein NO: 6. A stock preservative solution 
containing methylparaben is prepared by dissolving the solid in water (stock 2). A stock of an isotonicity agent, such 
as glycerin, is prepared by dissolving the neat liquid in water (stock 3). A stock of a physiologically-tolerated buffer, 
such as sodium phosphate is prepared by dissolving the solid in water (stock 4). A solution of Protein NO: 6 at 1 .6 mg/ 
mL is prepared by addition of an aliquot of stock 1 to a container which held an aliquot of stock 2 along with an aliquot 

45 of stock 3 along with the required quantity of water, adjusted if necessary with HCI or NaOH to an alkaline pH (7.6±0,3). 
After an appropriate incubation period at room temperature (30 minutes) an aliquot of stock 4 Is added. The solution 
pH is then readjusted if necessary with trace ^iL quantities of HCI or NaOH, to yield an Protein NO; 6 solution,.for 
example, at 1.6 mg/mL with 0.17% methylparaben, 16 mg/mL glycerin, and 14mM sodium phosphate at pH 7.8±0.1. 
The solution is then hand-filtered using a glass syringe with an attached 0.22jim syringe filter into a glass vial. 

50 

Example 4 

Obesity Protein Formulation 

55 To generate a solution of Human Obesity Protein mimetic (Protein NO: 6), solid bulk material, lyophilized from a 

neutral water solution, is redissolved in water to generate a stock solution (stock 1). The concentration of Protein NO: 
6 in stock 1 is verified by UV/Vis Spectrophometry by multiplying the maximum spectral absorbance (279nm or 280nm) 
by the dilution factor divided by the known extinction coefficient for Protein NO: 6. A stock preservative solution con- 
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taining chlorobutanol, for example, is prepared by dissolving the solid in water (stock 2). A stock of an isotonicity agent, 
such as glycerin, is prepared by dissolving the neat liquid in water (stock 3). A stock of a physiologically-tolerated 
buffer, such as sodium phosphate is prepared by dissolving the solid in water (stock 4). A solution of Protein NO: 6 at 
1.6 mg/mL is prepared by addition of an aliquot of stock 1 to a container which held an aliquot of stock 2 along with 

s an aliquot of stock 3 along with the required quantity of water, adjusted if necessary with HCI or NaOH to an alkaline 
pH (7.8±0.3). After an appropriate incubation period at room temperature (30 minutes) an aliquot of stock 4 is added. 
The solution pH is then readjusted if necessary with trace \iL quantities of HCI or NaOH, to yield an Protein NO: 6 
solution,. for example, at 1 .6 mg/mL with 0.5% chlorobutanol, 1 6 mg/mL glycerin, and 14mM sodium phosphate at pH 
7.8±0.1 . The solution is then hand-filtered using a glass syringe with an attached 0.22um syringe filter into a glass vial. 

10 The principles, preferred embodiments and modes of operation of the present invention have been described in 

the foregoing specification. The invention which is intended to be protected herein, however, is not to be construed as 
limited to the particular forms disclosed, since they are to be regarded as illustrative rather than restrictive. Nferiations 
and changes may be made by those skilled in the art without departing from the spirit of the inventbn. 

75 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(E) COUNTRY: United States of America 
'^0 (F) ZIP: 46285 

(ii) TITLE OF INVENTION: Therapeutic Proteins 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: A. M. Denholm 

(B) STREET: Erl Wood Manor 

(C) CITY: Windlesham 

(D) STATE: Surrey 

20 (E) COUNTRY: United Kingdom 

(F) 'ZIP: GU20 6PH 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 97308003.9 

(B) FILING DATE: 9th October 1997 



(2) INFORMATION FOR SEQ ID NO:l: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

15 10 15 



He Val Thr Arg He Asn Asp He Ser His Thr Xaa Ser Val Ser Ser 
50 20 25 30 



Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 
35 40 45 



Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 
50 55 60 
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10 



IS 



Leu Thr Ser Met Pro Ser Arg Asn Met lie Gin lie Ser Asn Asp Leu 
65 70 75 80 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 140 

Gly Cys 
145 

(2) INFORMATION FOR SEQ ID NO: 2: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

30 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 
15 10 15 

He Val Thr Arg He Asp Asp He Ser His Thr Gin Ser Val Ser Ser 
35 2 0 2 5 3 0 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu 'His Pro lie 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 
40 50 55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 
65 70 75 80 

^ Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

85 90 95 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

SO Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

115 120 125 

Leu Gin Gly Ser Leu^Gln Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 140 

55 Gly Cys 
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10 



20 



30 



35 



40 



45 



50 



145 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



^5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys Thr 

1 • • 5 10 ■ 15 



lie Val Thr Arg lie Asn Asp lie Ser His Ala Gin Ser Val Ser Ser 

20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro lie 
.35 40 45 

25 Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin lie 

50 .55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met lie Gin lie Ser Asn Asp Leu 
65 . 70 75 80 



55 



Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 .135 140 

Gly Cys 
145 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

1 5 .10 15 

He Val Thr Arg He Asn Asp He Ser His Thr Xaa Ser Val Ser Ser 
20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

50 55 ' 60 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 
65 70 75 80 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 140 

Gly Cys 
145 

INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 
1 5 10 15 . 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 
20 25 30 • 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 
50 55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

65 70 75 80 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 
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His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

g Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val- Val Ala Leu Ser Arg 

115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 140 

10 Gly Cys 

145 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 



IS 



2S 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 
1 5 10 15 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 
20 2.5 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

50 55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 
65 - 70 75 80 

Glu Asn Leu Arg Asp Leu. Leu His Val Leu Ala Phe Ser Lys Ser Cys 
40 85 90 95 

His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

100 105 110 



35 



4S 



Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 . 140 

so Gly Cys 

145 

(2) INFORMATION FOR SEQ ID^N0:7: 

(i) SEQUENCE CHARACTERISTICS: 
5^ (A) LENGTH: 146 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys Thr 
1 5 10 15 



15 



20 



lie Val Thr Arg lie Asn Asp lie Ser His Thr Gin Ser Val Ser Ser 
20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro lie 

35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin lie 
50 55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

65 70 75 80 

25 Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

85 90 95 

His Leu Pro Gin Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 



30 



35 



Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

115 120 .125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 140 

Gly Cys 
145 



(2) INFORMATION FOR SEQ ID NO: 8: 

40 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 



gQ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 
15 10 15 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 
55 20 25 30 
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15 



20 



30 



35 



50 



Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro lie 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin lie 
50 55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met lie Gin He Ser Asn Asp Leu 
65 70 75 80 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 . 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 
130 135 140 

Gly Cys 
145 



(2) INFORMATION FOR SEQ ID NO: 9: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 
1 5 . 10 , 15 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 
40 20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 
35 40 45 . 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 
45 50 55 60 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 
65 70 75 80 



55 



Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 

His Leu Pro Gin Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 
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Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 
130 135 140 

5 Gly Cys 

145 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

^5 (ii) MOLECULE TYPE: protein 



20 



2S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys Thr 

1 5 ■ 10 15 

He Val Thr Arg He Asn Asp He Ser His Ala Gin Ser Val Ser Ser 
20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

30 5 0 5 5 6 0 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 
65 70 75 80 



35 



40 



Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 

His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 
130 135 140 

45 Gly Cys 

145 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 



SO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys Thr 
15 10 15 

lie Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro He 
35 40 45 . 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 
50 55 60 . 

15 Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

65 70 75 80 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
85 90 95 



20 



25 



30 



35 



40 



45 



His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
115 120 125 

Leu Gin Gly Ser Leu Gin. Asp Met Leu Gin Gin Leu Asp Leu Ser. Pro 
130 135 140 

Gly Cys 
145 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRI PTION : . SEQ ID N0:12: 

Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 
1 5 . 10 15. 

He Vai Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

20 25 . 30 

50 Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 
50 55 60 

55 
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Leu Thr Ser Met Pro Ser Arg Asn Met lie Gin lie Ser Asn Asp Leu" 
65 70 75 80 

g Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

85 90 .95 

Ser Leu Pro Gin Thr Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 
100 105 110 

70 Va] Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 
130 135 140 

IS Gly Cys 

145 

(2) INFORMATION FOR SEQ ID NO: 13: 



20 



2S 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

15 10 15 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 
20 25 30 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro lie 
35 40 45 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

50 55 50 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 
65 70 75 80 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser CVs 
85 90 .95 

Ser Leu Pro Gin Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

100 105 110 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 
so 115 120 125 

Leu Gin Gly Ser Leu Gin Asp Met Leu Gin Gin Leu Asp Leu Ser Pro 
130 135 140 

Gly Cys 
55 145 
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Claims 

1 . An obesity protein of the formula: 



5 . 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

10 He Val Thr Arg He Asn Asp He Ser His Thr Xaa Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

IS Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 . 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

20 Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 



25 



115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys (SEQ ID NO: 1) 

■ wherein: 

35 Xaa at position 28 is Gin or absent; 

said protein having at least one of the following substitutions: 

Gin at position 4 is replaced with Glu; 
40 Gin at position 7 is replaced with Glu; 

Asn at position 22 is replaced with Gin or Asp; 

Thr at position 27 is replaced with Ala; 

Xaa at position 28 Is replaced with Glu; 

Gin at position 34 is replaced with Glu; 
45 Met at position 54 is replaced with methionine sulfoxide, Leu, lie. Val, Ala, or Gly; 

Gin at position 56 is replaced with Glu; 

Gin at position 62 is replaced with Glu; 

Gin at position 63 is replaced with Glu; 

Met at position 68 is replaced with methionine sulfoxide, Leu, lie, VSat, Ala, or Gly; 
so Asn at position 72 is replaced with Gin, Glu, or Asp; 

Gin at position 75 is replaced with Glu; 

Ser at position 77 is replaced with Ala; 

Asn at position 78 is replaced with Gin or Asp; 

Asn at position 82 is replaced with Gin or Asp; 
55 His at position 97 is replaced with Gin, Asn, Ala, Gly, Ser, or Pro; 

Trp at position 100 is replaced with Ala, Glu, Asp. Asn. Met. lie, Phe, Tyr, Ser, Thr, Gly, Gin, \^l or Leu; 

Ala at position 101 is replaced with Ser, Asn, Gly, His, Pro, Thr, or Val; 

Ser at position 102 is replaced with Arg; 
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Gly at position 103 is replaced with Ala; 

Glu at position 1 05 is replaced with Gin; 

Thr at position 106 is replaced with Lys or Ser; 

Leu at position 107 is replaced with Pro; 
5 Asp at position 108 is replaced with Glu; 

Gly at position 111 is replaced with Asp; 

Gly at position 118 is replaced with Leu; 

Gin at position 1 30 is replaced with Glu; 

Gin at position 1 34 is replaced with Glu; 
10 Met at position 136 is replaced with methionine sulfoxide, Leu, lie, Val, Ala, or Gly; 

Trp at position 138 is replaced with Ala, Glu, Asp, Asn, Met, lie, Phe, Tyr, Ser, Thr, Gly, Gin. Val or Leu; or 

Gin at position 1 39 is replaced with Glu; 

or a pharmaceutically acceptable salt thereof. 

15 

2. An obesity protein of Claim 1 selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, 
SEQ ID NO: 5. SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 
SEQ ID NO: 12, and SEQ ID NO: 13. 

20 3. A soluble parenteral formulation, comprising an obesity protein of Claim 1 and a presen/atlve selected from the 
group consisting of alkylparaben, chlorobutanol, or a mixture theriaof. 

4. A formulation of Claim 3. wherein the presen/ative is methylparaben, ethylparaben. propylparaben, or butylpa- 
raben. 

25 

5. A formulation of Claim 4, wherein the concentration of obesity protein Is about 1 .0 mg/mL to about 100 mg/mL. 

6. A formulation of Claim 4, wherein the concentration of obesity protein is about 5.0 mg/mL to about 50.0 mg/mL. 
30 7. A formulation of Claim 6, which further comprises an isotonicity agent. 

8. A formulation of Claim 7, which further comprises a physiologically acceptable buffer. 

9. A formulation of Claim 8, wherein the concentration of obesity protein is about 10 mg/mL, the presen/ative is 
35 methylparaben, the isotonicity agent is glycerin at about 16 mg/mL, and the physiologically acceptable buffer is 

disodium phosphate at about 14 mM, and which further comprises about 0.1% EDTA phosphate. 

10. A formulation of any one of Claims 3 through 9, wherein the obesity protein is a protein of SEQ ID NO: 2: 

40 



45 



so 



55 
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10 



IS 



(SEQ ID NO: 2) 

5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asp Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 

Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

Leu Thr Ser Met Pro Ser Arg Asn Val He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys 

said protein having a di-sulflde bond between Cys at position 96 and Cys at position 146; or pharmaceutically 
30 acceptable salts thereof. 

11 . A formulation of any one of Claims 3 through 9. wherein the obesity protein is a protein of SEQ ID NO: 6: 



20 



25 



35 



40 



45 



(SEQ ID NO: 6) 

5 10 15 

Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu He Lys Thr 

20 25 30 

He Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser Ser 

35 40 45 

Lys Gin Lys Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro He 

50 55 60 



so 



55 
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Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin He 

65 70 75 80 

5 Leu Thr Ser Met Pro Ser Arg Asn Met He Gin He Ser Asn Asp Leu 

85 90 95 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 

100 105 110 

10 His Leu Pro Ala Ala Ser Gly Leu Glu Thr Leu Asp Set Leu Gly Gly 

115 120 125 

Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu Ser Arg 

130 135 140 

IS Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu Asp Leu Ser Pro 

145 

Gly Cys 

20 said protein having a di-sutfide bond between Cys at position 96 and Cys at position 146; or pharmaceutically 

acceptable salts thereof. 

12. Use of a protein of Claim 1 for the manufacture of a medicament for the treatment of obesity. 

25 13. Use of a protein selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 
5. SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8. SEQ ID NO: 9, SEQ ID NO: 10. SEQ ID NO: 11, SEQ ID NO: 12, 
and SEQ ID NO: 1 3 for the manufacture of a medicament for the treatment of obesity. 
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